Multi-armed bandit - PDFSEARCH.IO - Document Search Engine

Multi-armed bandit
Results: 113

#	Item
81	Efficient Regret Bounds for Online Bid Optimisation in Budget-Limited Sponsored Search Auctions Long Tran-Thanh1 , Lampros Stavrogiannis1 , Victor Naroditskiy1 Valentin Robu1 , Nicholas R Jennings1 and Peter Key2 1: Univ Add to Reading List Source URL: research.microsoft.com Language: English - Date: 2014-06-20 12:31:45 Analysis of algorithms Estimation theory Normal distribution M-estimator Time complexity Asymptotically optimal algorithm Algorithm Multi-armed bandit Statistics Theoretical computer science Applied mathematics
82	Linear Submodular Bandits and their Application to Diversified Retrieval Carlos Guestrin Machine Learning Department Carnegie Mellon University Add to Reading List Source URL: www.select.cs.cmu.edu Language: English - Date: 2011-10-28 13:54:18 Machine learning Submodular set function Operations research Theoretical computer science Natural language processing Mathematical optimization Multi-armed bandit Algorithm Linear programming Statistics Applied mathematics Mathematics
83	Selecting the State-Representation in Reinforcement Learning Odalric-Ambrym Maillard INRIA Lille - Nord Europe [removed] Add to Reading List Source URL: eprints.pascal-network.org Language: English - Date: 2011-11-02 05:20:38 Markov processes Dynamic programming Markov decision process Stochastic control Distribution Multi-armed bandit Statistics Mathematical analysis Generalized functions
84	Beat the Mean Bandit Yisong Yue H. John Heinz III College, Carnegie Mellon University, Pittsburgh, PA, USA Thorsten Joachims Department of Computer Science, Cornell University, Ithaca, NY, USA Add to Reading List Source URL: www.yisongyue.com Language: English - Date: 2011-05-11 16:32:53 Number theory Machine learning Multi-armed bandit Stochastic optimization Normal distribution Valuation Factorial Mathematics Statistics Mathematical analysis
85	Linear Submodular Bandits and their Application to Diversified Retrieval Carlos Guestrin Machine Learning Department Carnegie Mellon University Add to Reading List Source URL: www.yisongyue.com Language: English - Date: 2011-10-28 13:51:45 Machine learning Submodular set function Operations research Theoretical computer science Natural language processing Mathematical optimization Multi-armed bandit Algorithm Linear programming Statistics Applied mathematics Mathematics
86	Latent Bandits. Odalric-Ambrym Maillard ODALRIC - AMBRYM . MAILLARD @ ENS - CACHAN . ORG The Technion, Faculty of Electrical Engineering[removed]Haifa, ISRAEL Shie Mannor The Technion, Faculty of Electrical Engineering 320 Add to Reading List Source URL: jmlr.org Language: English - Date: 2014-02-16 19:30:21 Applied mathematics Asymptotic analysis Big O notation Mathematical notation Asymptotically optimal algorithm Algorithm Multi-armed bandit Analysis of algorithms Mathematics Statistics
87	Thompson Sampling for Complex Online Problems Add to Reading List Source URL: jmlr.org Language: English - Date: 2014-02-16 19:30:21 Econometrics Statistical inference Machine learning Confidence interval Multi-armed bandit Thompson sampling Reinforcement learning Bayes estimator Dimensional analysis Statistics Measurement Estimation theory
88	Incentivizing Exploration PETER FRAZIER, Cornell University, Ithaca NY DAVID KEMPE, University of Southern California, Los Angeles CA JON KLEINBERG, Cornell University, Ithaca NY ROBERT KLEINBERG, Cornell University, Ith Add to Reading List Source URL: www.cs.cornell.edu Language: English - Date: 2014-05-07 00:31:23 Gittins index Probability theory Multi-armed bandit Risk-neutral measure Mechanism design Statistics Decision theory Design of experiments
89	WWW 2010 • Full Paper April 26-30 • Raleigh • NC • USA A Contextual-Bandit Approach to Personalized News Article Recommendation Add to Reading List Source URL: www.research.rutgers.edu Language: English - Date: 2010-05-02 03:42:39 Machine learning Cybernetics Theoretical computer science Multi-armed bandit Stochastic optimization Reinforcement learning Greedy algorithm Recommender system Algorithm Statistics Mathematics Applied mathematics
90	All learning is local: Multi-agent learning in global reward games Yu-Han Chang MIT CSAIL Cambridge, MA 02139 Add to Reading List Source URL: people.csail.mit.edu Language: English - Date: 2004-07-01 07:47:52 Markov processes Stochastic control Robot control Reinforcement learning Q-learning Markov decision process Kalman filter Multi-armed bandit Machine learning Statistics Markov models Dynamic programming

UPDATE